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We summarize some aspects of electrostatic interactions in the context of viruses. A simplified but, within well defined lim- 
itations, reliable approach is used to derive expressions for electrostatic energies and the corresponding osmotic pressures in 
single- stranded RNA viruses and double- stranded DNA bacteriophages. The two types of viruses differ crucially in the spatial 



distribution of their genome charge which leads to essential differences in their free energies, depending on the capsid size and 
total charge in a quite different fashion. Differences in the free energies are trailed by the corresponding characteristics and 
variations in the osmotic pressure between the inside of the virus and the external bathing solution. 
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Qh Viruses are abundant and ubiquitous 1 and it is possible that 
there are no forms of life immune to the effect of viruses 2 , 
which may be advantageous in the fight against diseaseEHS. 
It appears that there are even viruses that initiate their "lifecy- 
cle" exclusively in combination with some other viruses, often 
"stealing" the protein material of those viruses and diverting 
the cellular processes they initiated to their own advantage03- 
<-h the viruses are thus parasites even of their own kin. Although 
Q H we know the exact nature (the shape and the genome) of only 
about a hundred^ viruses, it seems that they are almost as di- 
verse as life itself, so that they represent a type of index to the 
library of life forms that they parasitize upon. The viruses are 
indeed only abbreviated, indexed, crippled representation of 
life and they can hardly be classified as life. They are most 
often viewed by physicists (or "physical virologists' ^SQSIQ 
as hetero-macromolecular complexes, i.e. complexes of vi- 
ral proteins and the genome molecule (DNA or RNA) that are 
reasonably stable in extra-cellular conditions and that initiate a 
complicated sequence of molecular interactions and transfer- 
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mations once they enter a suitable cell 13 . According to such a 
view, a virus must in its structure somehow "encode" the cru- 
cial steps of its replication process. For example, the proteins 
that make its protective shell (virus capsid) must have such ge- 
ometric and chemical characteristics as to activate the appro- 
priate receptors on the cell membrane so that they can attach 
to and penetrate its interior 14 15 . The reverse of this process 
when mature viruses are released from the infected cell im- 
plies membrane adsorbtion and budding 16 ! 17 1 to a large extent 
promoted by electrostatic interactions 18 . The virus needs to 
be sufficiently stable in the extracellular conditions, yet suf- 
ficiently unstable once it enters the cell, so that it can disas- 
semble and deliver its genome molecule to the cellular repli- 
cation machinery 19 . Once it fulfills walking this tight rope of 
incipient instability, the manufacturing of virus components 
in the cell proceeds, leading eventually to new viruses^. It 
is certainly of interest to elucidate the nature of interactions 
in viruses that enable it to function "between a rock and a 
hard place", equilibrating on the border of stability, not just 
from a fundamental scientific point of view but also techno- 
logicallyEEa. 

In this review, we shall concentrate on the description of 
virus structure in terms of the electrostatic interactions, i.e. 
we shall be interested in the corresponding energies of single 
viruses (e.g. the energy required to assemble a virus from its 
constituents) and osmotic pressures acting in a virus, though 
electrostatic interactions are just as important for understand- 
ing the interactions between the virues^. In fact, the rele- 
vance of strong electrostatic interactions for the stability of 
tobacco mosaic virus (TMV) gels was invoked already in the 
seminal work of Bernal and Fankuchen 2z Q while the Poisson- 

f They refer to electrostatic interactions as being "probably due to the ionic 
atmospheres surrounding [viruses]". 
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Botzmann theory of electrostatic interactions was applied to 
the case of viruses^ soon after its publication 26 and even be- 
fore it was applied to lipid membranes^. 

We shall examine the relation of the formation energy and 
osmotic pressure of a virus to its structure. Our emphasis is 
on the electrostatic interaction, first because it contributes a 
significant part to overall energetics of the virusesEHK] but 
also because the electrostatic part of the free energy is the 
part that can be calculated with a fair amount of precision^. 
Subtler properties of viruses, such as the nature and extent of 
the ordering of packed DNA/RNA molecul d 33 ! 34 ! may depend 
on other physical effects, such as genome molecule entropy, 
yet an important contribution to the energy of protein-genome 
packaging is of electrostatic nature^. 

The aim of this work is to show the usefulness of elemen- 
tary physical concepts describing electrostatic interactions as 
they apply to viruses 36 . Although we will explain some of the 
intricacies involved in the modeling of viruses, our emphasis 
will be on the essential physics involved. Although the sim- 
ple (and necessarily approximate) expressions that we shall 
expose and employ in this review can be derived from pre- 
vious, more elaborate publications, we intend to use mostly 
scaling-based reasoning in their derivation and application in 
the context of viruses. 

Electrostatic interactions in the context of viruses are com- 
plex and bring together various theoretical approaches ex- 
tending from the theory of polyelectrolytes^^ and th en all 
the way to the theory of highly charged Coulomb fluids®^, 
sometimes exploiting elaborate theoretical concepts and for- 
mally demanding approaches. In this review we shall thus 
try to retain only most easily understood and applicable con- 
cepts^, yet sufficiently reliable when it comes to descr iption 
of biological system in general and viruses in particular^MI 
This approach necessarily excludes the details of some of 
the more arcane aspects of the theory of electrostatic inter- 
actions in aqueous solutions (non-linear salt screening, effects 
of polyvalent counterion correlations, overcharging by the mo- 
bile charge and similar^). We do explain the essence of these 
aspects and their possible relevance in context of viruses in 
Section O 

We shall emphasize the non-specific aspects of the contri- 
butions electrostatic interactions and will thus avoid a detailed 
exposition of the ion- specific effects " 6 " 7 ! This means that we 
do not deal with chemical specificity of different ions that 
may drastically modify electrostatic interactions. These ef- 
fects may be of importance for initiation of assembly, or for 
speeding up of the assembly, but they are not of primary inter- 
est to this review. Our point of view concerns the more robust 
aspects of virus energetics that can be understood in a suffi- 
ciently generic (and simple) physical framework. 




(1,1) (2,0) (2,1) (3,0) 

Fig. 1 The geometry behind the Caspar- Klug (CK) classification of 
viruses. Icosahedral viruses that obey the CK principle can be "cut 
out" of the lattice of protein hexamers, as shown in the figure. Upon 
folding of the cut-out piece, twelve of protein hexamers are 
transformed in pentamers. The CK viruses are described with two 
integers, h = 2 and k = 1) in the case shown, which parametrize the 
vector A. The T-number of the capsid is related to h and k as 
T = h 2 + hk + k 2 , and the number of protein subunits is 607; see 
RefsJSEHfor details. The bottom row of images displays the CK 
structures with T = 3,4, 7, and 9 (from left to right). 

2 A simple description of a virus 

The emphasis of this review is not on the symmetry and shapes 
of viruses - there are already many good reviews on this sub- 
ject (see e.g. Ref. 6 ). For our purposes it is enough to state that 
all viruses are made of two essential parts: protein coating or a 
capsid and viral genome (of DNA or RNA type) situated in the 
capsid interior. There are also viruses that in addition to these 
two essential components need an additional "wrapper", i.e. a 
piece of cellular membrane, in order to function properly and 
fuse with the cellular membrane surface^. These viruses are 
referred to as enveloped (in contrast to non-enveloped viruses 
which do not have a membrane coating). Because of severe 
restrictions on the length of their genome encoding the viral 
shell proteins, the virus capsids are made of many copies of 
one or at most a few types of proteins which are arranged in 
a highly symmetrical manner as first proposed in the seminal 
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work by Crick and Watson^. 

Nearly spherical viruses, also called icosahedral viruses, 
show mostly but not always 50 icosahedral order and the pro- 
teins that make them can be arranged in the clusters of five 
(pentamers) or six (hexamers; see Fig. [I] for details and defi- 
nition of Caspar-Klug classification) This arrangement may 
be only conceptual, but may also have a physical meaning 
that the interactions in clusters (capsomeres) are som ewhat 
stronger than the interactions between the clusters^El. Crick 
and Watson surmise that nearly all viruses can be classified 
either as nearly spherical, i.e. of icosahedral symmetry, or 
elongated of helical symmetry^. 

Ico sahedr al viruses tend to look more polyhedral when 
l ar g ei j6]52|53] There are a j so non-icosahedral viruses that do 
not fit in CK classification. Here are some prominent ex- 
amples: Capsids of some bacteriophages (viruses that infect 
only bacteria) are "elongated" (prolate) icosahedral, i.e. the 
icosahedral sides around the equator are not equilateral, but 
isosceles triangles^. Capsids of some plant viruses (e.g. to- 
bacco mosaic virus) are (open and hollow) cylinders and their 
genome molecule is situated in the empty cylindrical space 
formed by proteins. HIV virus is also non-icosahedral, but 
is not an elongated icosahedron^l Its capsid typically looks 
conical, being elongated and narrower on one side^. Further- 
more, even when the viruses are spherical, it is sometimes dif- 
ficult to classify them according to CK scheme and the typical 
pentamer-hexamer ordering is not Some viruses 

are multi-layered, i.e. they consist of several protein capsids 
each of which may be built from different protein^. Each of 
these capsid layers may individually conform to the CK prin- 
cipleES An alternative to CK classification has recently been 
proposed that apparently contains the CK shapes as the subset 
of all possible shapes, including those that do not show a clear 
pentamer-hexamer pattern 58 ! 59 ! 

There is a certain universality in the size of capsid proteins. 
By analyzing more than 80 different viruses (with T numbers 
from 1 to 25), we have found that the area of a protein in a 
capsid is fairly conserved and amounts to ~ 25 nm 2 ^. The 
thickness of the protein, i.e. the thickness of the virus capsid 
in question varies more, but is typically in the interval ~ 2 - 5 
nm. The "typical" virus protein can thus be imagined as a disk 
/ cylinder of radius ~ 3 nm and thickness ~ 3 nm. In some 
viruses these "disks" have positively charged protein "tails" 
that protrude in the capsid interior and whose role is to bind 
to a a negatively charged genome molecule (typically ssRNA; 
see Sec. [4] and Ref. 63 ). There is some universality in the dis- 
tribution of charges along and within the virus capsid, Fig. 



While Caspar-Klug dipoles, corresponding to a bimodal distri- 
bution of positive charges on the hypotopal side and negative 
charges on the epitopal side of the capsid can be observed^, 
and we recognize them also for the CCMV capsid in Fig. [2| 
this is certainly not a rule and other, e.g. monomodal, dis- 
tributions can be observed just as well. The in-plane angular 
distribution of charges along the capsid thickness also shows 
complicated variations within the constraints of the icosahe- 
dral symmetry group^, see Fig. [2] Last but not least, the 
magnitude of the charges on the surface of the capsomeres is 
regulated by the dissociation equilibrium while for the buried 
charges it would have to be estimated from quantum chemical 
calculations^^. 

Thus far, we have talked about the order of virus proteins 
and we have said nothing about the distribution and ordering 
of the virus DNA or RNA molecule inside the capsid. This 
will be discussed in the following sections. The virus genome 
molecule codes for the proteins of the capsid, but also for other 
proteins needed in the process of virus replication, depend- 
ing on a virus in question. ssRNA viruses need to code for 
protein that replicates the virus ssRNA and some viruses also 
encode the regulatory proteins that are required for correct as- 
sembly (scaffolding proteins, see e.g. Ref. 68 ) and the proteins 
required for release of viruses from the infected cell™ The 
amount of information that is required constrains the length of 
the genome molecule from below. 

3 Self-assembling viruses and their energies 



* There are also viruses with icosahedral order, yet containing only pentamers. 
Such is the polio virusl^ 

X The iso-surfaces obtained here were calculated by first assigning one elemen- 
tary positive charge to charge residue of each lysine and arginine amino acid 
and protein N-termini, one negative charge to charge residue of each aspar- 



Many viruses can self-assemble^ though the details of the as- 
sembly pathways are seldom well understood^. This means 
that the "ingredients" for a virus, individual proteins and 
its genome molecule, can spontaneously form (assemble in) 
closed, functional viruses, even outside the cellular environ- 
ment, in conditions of appropriate pH factor and salt con- 
tic acid and glutamic acid and protein C-termini, and 0.1 positive charges to 
histidine charge residues (this corresponds to the histidine fractional charge at 
neutral pH). We thus obtained the 3D coordinates of charges in the capsid, yet 
this discrete data does not give a visually clear insight in the three-dimensional 
nature of the charge distribution. To obtain an informative visual representa- 
tion, we used this data to construct spatial scalar fields of positive and negative 
charge distributions (separately), by assigning to each charge residue a scalar 
density, x(r) = \q/eo \ {l — [min(r, W)/W] 2 } 2 , where q is the charge of the 
residue, eo is the elementary charge, r is the distance from the charge residue, 
and W is the parameter specifying the extent of the density. The total den- 
sity field was constructed by summing the scalar densities of all the charges 
(separately for positive and negative distributions). From the thus obtained 
space field, an isosurface showing the value of the field at t is plotted. The 
objects obtained in this way are known in the computer graphics community 
as metaballs or blobs 64 . In Figs. [|]and[5] W = 1.34 nm and t — 0.85. Amino 
acids in the protein interior, i.e. in the capsid wall may not carry charge, as 
we have assumed, but due to the nature of graphical representation we have 
chosen, panels a) and b) of Fig. [2](but not panel c) would ad equately repre- 
sent the (surface) charge distributions also in the case when only amino acids 
sufficiently close to the capsid surface were charged. 
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Fig. 2 The calculated representation of the charge distribution on the capsid of cowpea chlorotic mottle (ssRNA) virus as explained in the 
text, based on the 1CWP entry in the RCSB Protein Data Bank: a) The isosurface of positive charge (red), b) The isosurface of negative 
charge (blue), c) The combined isosurfaces of positive and negative charges shown in the capsid cut in half so that its interior is seen. On the 
lef-hand side of image in panel c) (the left of the white vertical line), the (cut) isosurface of negative charge (blue) is translated infinite simally 
closer to the viewer, while it is the opposite on the right-hand side of the image. 



centration. This is typical for viruses that contain ssRNA 
molecule and it was first demonstrated in tobacco mosaic virus 
by Fraenkel-Conrat and Williams 71 . They were able to pro- 
duce infectious virus particles by simply mixing two solutions, 
one of them containing only virus proteins, and the other virus 
RNA molecule. 

The process of self-assembly can proceed spontaneously 
only if the free energy (F) of the assembled virus is lower 
than the free energy of the disassembled state. This means 
that, some viruses at least, can be viewed as thermodynami- 
cally optimal structures, i.e. they represent minima of the free 
energy^. 

Upon assembly of proteins and RNA in a virus, their en- 
tropy (S) decreases, so the process can not proceed sponta- 
neously unless there is a gain in the internal energy (U) of the 
system, i.e. there is a favorable "binding" energy of ingredi- 
ents once they form a virus. Even then, the process of assem- 
bly can proceed only when the concentration of the ingredients 
is sufficiently large, i.e. above a critical concentration required 
for assembly^. Below that concentration, the entropic contri- 
bution to the free energy dominates and although the binding 
energy (enthalpy) of the ingredients in a capsid is favorable, 
the net gain in free energy is not, and the proteins and RNA 
molecules remain in a disassembled state. 

Not all viruses can self-assemble. Some types of viruses, 
even when all the ingredients are available, can form only in 
the cellular environment, i.e. they require some of the cellular 
mechanisms for the assembly. The typical requirement is ATP 
energy which suggests that those viruses do not correspond to 
the simple free energy minima, but are rather examples of free 
energy driven structures at some elevated position/plateau in 
the free energy landscape. The use of ATP energy for assem- 



bly is typical for bacteriophage viruses as we shall see in the 
following. 

3.1 Energies and assembly of empty viruses 

The viruses are kept together by the same interactions as those 
governing the "living matter" ^ 74 ! 75 i The interactions are thus 
many different guises of electromagnetic force, sometimes as 
"direct" interactions between entities (e.g. electrostatic, van 
der Waals (vdW), steric repulsion interaction,...) 76 and some- 
times as "indirect" or effective interactions which manifest as 
forces but arise only in presence of the bathing medium at 
finite temperature, T (e.g. hydrophobic or hydration interac- 
tions)E2Ell xhe direct interactions are also modified by the 
presence and nature of the bathing medium, yet they survive 
outside it. 

A very important characteristic of virus genome molecules 
is the negative charge they carry, due to dissociation of phos- 
phate groups on RNA and DNA bases 79 . A compactification 
of a highly negatively charged genome in a capsid interior re- 
quires energy. That is why the proteins in contact with the 
DNA or RNA are often positively charged. This may not be 
the case for the protein as a whole, but it is quite often the case 
for parts of the surface of proteins in direct contact with the 
DNA or RNaEM2 ( see Fig. [5J note that the capsid interior is 
mostly red, i.e. positively charged). These parts of proteins are 
often spatially extended and they in some cases form highly 
positive "tails", see next section. The complementary charges 
on the viral proteins and genome increase the electrostatic in- 
teraction of the complex and decrease its energy, enabling an 
easier assembly. That is the reason the viral genome codes 
for such proteins, but there are viruses (e.g. polyomavirus) 
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which utilize cellular, very positively charged proteins (his- 
tories) to associate them with their genome so as to reduce the 
electrostatic energy and allow for an easier packing 14 . All this 
indicates the importance of electrostatic interactions for virus 
assembly. 

The charge on the proteins (and on the DNA/RNA) depends 
on the pH factor of the solution. Modifications of pH factor, 
and the corresponding dissociation/saturation equilibrium of 
ionic bonds in proteins leads to variation of their charge^MU 
and a change in the electrostatic energy of the nucleopro- 
tein complexes, as will be seen in the following. Modifica- 
tion of the pH factor may occur in the cellular environment 
in endosomes/vesicles i.e. pieces of cellular membrane that 
the viruses carry along with them as they penetrate the cell. 
Some viruses assemble in specific subcellular compartments 
(called "virus factories" 82 or inclusion bodies^) which may 
also have an adequately altered, local pH value. 

3.1.1 A simple model of a virus capsid and the corre- 
sponding energy. The simplest representation of the protein 
charge distribution would be a positive charge homogeneously 
distributed on a protein. A homogeneous distribution of pos- 
itive charge will tend to keep proteins apart, yet it has been 
experimentally observed that proteins of some viruses assem- 
ble in (empty) capsids under appropriate conditions. Such is 
a virus of hepatitis B that has been experimentally studied in 
considerable detail by A. Zlotnick and coworkers 51 . Since 
the viral proteins do assemble in empty capsids, there must be 
an attractive contribution to their interaction. The source of 
this contribution is a combination of hydrophobic and van der 
Waals interactions 77 , with hydrophobic component playing a 
dominant role. This conclusion can be read out from the ex- 
periments performed by Ceres and Zlotnick 51 . They observed 
that the strength of protein-protein interactions in capsids in- 
creases with temperature and this suggests an important en- 
tropic contribution to the interaction, hydrophobic interaction 
being the obvious candidate^. 

To understand the cohesive energy of a virus capsid, we first 
need to evaluate a seemingly simple problem: obtain the elec- 
trostatic self-energy of a uniformly (positively) charged (with 
surface charge density a), permeable, infinitely thin sphere of 
radius R - this is the zeroth level description of a capsid. This 
problem can be solved on the mean-field level by treating the 
ions as ideal gas, which adjust to the external potential and 
contribute to it via their charge density. This is the Poisson- 
Boltzmann (PB) approach which yields nonlin ear differential 
equation for the electrostatic potential, (pZESDS] j t can ^ e 
rived by minimizing the appropriate free energy 84 which has 
the form 



F PB to{r)M(r),(*{r)] = J f PB ^(r), V*{r),c*(r))d 3 r, 



where the free energy density is given by 

/ PB H>(r),V4)(r),c ! '(r)) = - ±e eV(Kr) 2 + £ e i c i (r)^r) + 

;=± 

+ £ k B T [c'»lnc'» -c\r) - (c^lncj, "4)] + 

i=± 

+ e p P (r)(Kr). (2) 

Here eop p (r) is the charge density of the capsid, T is temper- 
ature, Icb is the Boltzmann constant, c l are the concentrations 
of (monovalent) salt ions, with c l their bulk concentrations, 
££n is the permittivity of water, and eo is the electron charge. 
Minimizing the above free energy w.r.t. <|>(r), V<|)(r) as well as 
c l (r), leads to the PB equation of the form 

-eeoV 2 4>(r) = £ ^" MW +^oP P (r), (3) 

i=± 

where (3 _1 = ksT and e{ is the charge of the ions, i.e. ei = ±eo- 
When the electrostatic potentials in the solution are small, 
£oP(|> <C 1 and we are dealing with a symmetric system, e.g. 
1-1 electrolyte, that has = < 



Cq = Co, the PB equation can be 
linearized yielding the Debye-Hiickel (DH) equation for the 
potential 85 , of the form 



-V>(r) = 




(4) 



where we took into account that the salt is assumed to be uni- 
univalent and thus Y<i=± e i c o — 0- At this point one standardly 
introduces the inverse (Debye-Hiickel) screening length K _1 , 
with k 2 = p (£ i=± ejc^) /eeJSEI 

The linearity of DH equation renders it amenable to several 
ways of solving, including the Green function method^. But 
the simplest way to think of the DH approximation is in terms 
of the renormalization of electrostatic interactions (in vacuo) 
by salt ions. The effective interaction between the charges Q\ 
and 22, separated by ri — r 2 in the solution of monovalent 
ions (with concentration en) of (relative) dielectric constant 8 
is given by the DH potential of the screened exponential form 
U(y\ — r 2 ). The easiest way to obtain the self energy of a 
uniformly charged sphere in the DH approximation is to sum 
the pair DH interactions over the sphere surface, 

Fdh — ~2 J dS ^ j dS 2 U(ri - r 2 ) = 



1 



2 47t£ £ 



dS 2 



exp(-K|ri -r 2 |) 



r l- r 2 



(5) 



(1) 



where dS\ and dS 2 are infinitesimal elements of the sphere 
surface centered around vectors ri and r 2 , and factor of 1/2 
accounts for double counting of the pair interactions. Since 
the radii of viruses are typically of the order of 20 nm, and the 
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DH screening length in the physiological conditions (co ~ 150 
mM) is K _1 ~ 1 nm, the limit of kR ^> 1 is often implied. The 
range of integration is effectively cut on the scale of K _1 . In 
case of interactions on a sphere, this defines a spherical cap. 
But, when kR ^> 1 , this spherical cap transforms in a disk, and 
this renders the two integrations independent: 

a 2 1 f f°° f 2n 

lim F DH = / dSi / dr 2 / # 2 exp(-Kr 2 ). (6) 

k/?>i 47i8 e2 t / Jo Jo 

Note that the integration over r 2 can be extended to infinity, 
since K in the exponential function acts as a cutoff parame- 
ter^l. The self-energy of the capsid is thus^ 



lim F DH 

K/?>1 



%<5 2 R 2 
8 £K 



(7) 



We have denoted the self-energy using letter F to indicate that 
it corresponds to free energy, containing also the entropy of 
salt ions. One should note the meaning of this quantity: it is 
the energy required to bring infinitesimal charges from infinite 
separations in the solution to the capsid. When the capsid con- 
sists of many weakly charged proteins, one may think of this 
quantity as being approximately the electrostatic contribution 
to the assembly free energy. To estimate this quantity, we need 
an information on the surface charge density of the virus pro- 
teins. This can be estimated from their amino acid content, 
and typically a ~ 1 eo/nm 265 . Taking now R ~ 20 nm and 
c = 100 mM, we obtain F DH ~ 10 4 k B T. 

The hydrophobic energy can be estimated from the area of 
proteins engaged in protein-protein contacts, i.e. from the vol- 
ume slice around the proteins one water molecules thick. The 
capsid of 20 nm typically have T = 3 triangulation number, 
i.e. they consist of 180 protein subunits. The length of protein 
contacts in such a capsid is about 3000 nm. This gives the 
total area exposed to the protein contacts of about 6000 nm 2 
if we assume the capsid thickness of 2 nm. The energy of at- 
tractive protein interactions (hydrophobic and van der Waals) 
per unit area of the exposed protein surface is typically of the 
order of 10 mJ / m 2 ^. Multiplying the estimated exposed 
area with this energy, the estimated attractive hydrophobic in- 
teraction is obtained as, Fhp ~ 10 4 ksT. This is of the same 
order of magnitude as the electrostatic repulsion and one is 
led to conclude that the interactions tending to dissolve a cap- 
sid and those keeping it together are in a tight balance (Ceres 
and Zlotnick have measured the free energy of the hepatitis B 
capsid to be — ~ 1.5 10 3 ksT i.e. about 5-6 ksT per inter- 
protein contact 51 ). The reason for this is most likely the fact 
that viruses, in addition to being able to assemble, need also 
to disassemble and deliver their genome molecule to the cell. 
In simple viruses this may be triggered by variation of pH and 
ionic concentration in different cell regions which increase the 
electrostatic repulsion in the capsid. Sufficiently large changes 



in pH lead to variation (increase) of protein charge 89 , i.e. a, 
while reduction of ionic concentration leads to decrease of K, 
and as Eq. ^ shows, both effect modify the electrostatic self- 
energy of the capsid. Ceres and Zlotnick ^ have experimen- 
tally demonstrated that the free energy of hepatitis B capsids 
increases with concentration of mono-valent salt. This may be 
interpreted as the screening of repulsive protein-protein elec- 
trostatic interaction 6590 . A sufficiently large modification of 
the electrostatic interaction may lead to enthalpic instability 
of the capsid and to its disa ssembly 87 . Experimental stud- 
ies of virus assembly 51 1 9 ^ 92 l clearly show the importance of 
pH factor and the salinity of the solution for the assembly of 
complete capsids. For sufficiently large concentration of virus 
proteins, the assembly depends on the values of both of these 
parameters^ 91 ! 92 !. 

3.1.2 Refined models of virus capsids. The simplest ap- 
proach to capsid electrostatics presented above may be im- 
proved in several respects. The easiest one is to examine the 
complete DH solution to the problem, i.e. in a whole range 
of kR values, valid in particular for lower ionic concentrations 
when K < R. In fact one can derive a general formula^ 



Fdh 



2%g z R 



2r>2 



(8) 



8 £K[1 +coth(Kfl)] 

that describes the capsid electrostatic free energy for any K 
within the range of validity of the DH approximation.. 

A next level of refinement is to abandon the assumption of 
smallness of electrostatic potential $eo§ <^ 1 and to solve the 
PB equation, Eq. [3] for the potential and obtain the capsid en- 
ergy in this way. This is still not the "exact" solution to the 
problem, since the PB approach is an approximation of the 
mean-field genre and it neglects ionic correlations^. Yet, the 
PB approach is more reliable for large surface charge densities 
and smaller ionic concentration, i.e. in cases where (|) is not 
necessarily small. In the limit P^ <C 1 it of course reduces 
back to the DH limit. The detailed study and comparison of 
PB and DH approaches to capsid electrostatics has been per- 
formed in Ref. 90 . The PB capsid energies are always smaller 
from the corresponding DH values, yet the functional depen- 
dence of F on a, R, and co is similar in both approaches, the 
R 2 dependence in particular. The DH results are very reliable 
quantitatively when R ~ 20 nm, a < 0.8 eo/nm 2 and co > 50 
mM. 

The calculations that adopt a refined representation of the 
capsid with regards to its finite thickness, 8, have also been 
performed. The distribution of charge across the thickness 
of the capsid is such that the positive charges are often con- 
centrated on the capsid interior surface, while, typically neg- 
ative charges are concentrated on the capsid exterior surface 
(see Fig. |2j. There are also a few but functionally non- 
negligible ^net charges embedded in the interior of the cap- 
sid proteins^!], while most of the charges there are partial 
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charges due to electronic charge redistribution in chemical 
bond formation! 67 ! 95 !. To account for the finite thickness of 
the capsid, it was treated as a dielectric shell with relative per- 
mittivity E p , impermeable to ions, with interior and exterior 
surfaces which are uniformly charged 90 with surface charge 
densities Ci and c>2, respectively (see Fig. [3]). This model 
shall still prove useful, so we discuss it briefly in the follow- 
ing. 




Fig. 3 An illustration of an electrostatic model of a viral capsid with 
finite thickness, 5. Salt ions are represented by small spheres. The 
interior of the capsid contains water and salt ions, but the salt ions 
are not present in the viral capsid shell. 

Intriguingly, for this model also, in the regime kR ^> 1 the 
capsid free energy scales with the second power of capsid ra- 
dius, both in the DH and PB cases^. DH expressions for 
the electrostatic potential and free energy can be obtained an- 
alytically, yet they are sufficiently transparent only in certain 
limits. A limit of interest to us is kR ^> 1 
this case, 



8 > e p , 8 <C R. In 



F D// (oi,o 2 ,8)=27C^ 



e p (oi + a 2 ) 2 + e (of + of) K§ 

£ £K(2£ p +£k8) 



(9) 



For viruses in physiological conditions, we may further take 
E p <C £ (for proteins, E p ~ 5^), and kS ~ 1, which simplifies 
Eq. 0) to 

2,«,j + sj)«* 

£o£K 

By comparing this equation with Eq. [7} one concludes that 
the qualitative behavior of the capsid free energy is not impor- 
tantly modified by an introduction of the finite capsid thick- 
ness and that the free energy is of the same order of magnitude 
as in the case of infinitely thin shell. 



Angularly nonuniform distribution of the capsid charge 
(still positioned on a perfectly spherical, infinitely thin cap- 
sid), in accordance with its icosahedral symmetry, can be in- 
troduced in the electrostatic model. This complicates the treat- 
ment and introduces special function series, but the electro- 
static free energy still retains the already familiar functional 
DH behavior in the kR ^> 1 limit. In particular, Marzec and 
DayL^have obtained that 



%I? Q R 2 
£ £K ' 



(ID 



where 1? Q is the average square charge density, 1? Q = 
(4n)~ l f d£2Lg(£2), and £1 is the spatial angle. 

A completely numerical PB approaches are also possible. 
These start with a determination of the spatial distribution of 
capsid charge based on the capsid amino acid content and the 
atomic coordinates determined from X-ray studies^. The PB 
equation is then solved on a three-dimensional grid using ad- 
vanced numerical routines^. Such approaches are, however, 
less transparent concerning the scaling of energies with vari- 
ous parameters of the system, e.g. virus radius, its total charge, 
salt concentration, and similar. 



4 Energies of ssRNA viruses 

Viruses that contain the single- stranded RNA molecule (ss- 
RNA) often self- assemble^. A simplified description of the 
ssRNA molecule in these viruses characterizes it as a generic 
flexible poly electrolyte 81 98 . The ssRNA flexibility is de- 
scribed by its effective persistence length i.e. the length on 
which the ssRNA refuses to bend. For flexible polyelec- 
trolytes, this effective length is of the order of monomer (nu- 
cleotide) separation (a ~ 0.5 nm) and they can thus be well 
described within the framework of the Edwards - de Gennes 
flexible chain mode P 9 * 100 *. It is the connectivity of the chain 
that gives an essential imprint to the behavior of monomers 
of the chain as opposed to free particles in solutionis. The 
long range interactions between monomers, such as Coulomb 
interactions in the case of ssRNA, are modified in an essential 
way when coupled to the connectivity of the chainL^l The 
main difference between the flexible chain model describing 
the salient features of the ssRNA chain and the semiflexible 
chains, such as dsDNA, is that the elasticity of the chain is 
purely ent ropic^^ in the former while being enthalpic in the 
latter cas d 104 ! 10 ^. The connectivity of the chain introduces 
many important features also in the behavior of the chain in 
external fields as is the case in adsorption to charged sur- 
f aces LL06] an( j foe consequent bridging interactions present be- 
tween two apposed charged surfaces 107 . 

It is an experimental factP^SGSl ( anc i anv reasonable the- 
ory should account for it) that the ssRNA viruses contain the 
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ssRNA molecule in a thin shell closely distanced from the in- 
terior capsid radius. In the flexible poly electrolyte theory the 
thickness of this adsorbed ssRNA polyelectrolyte layer is of 
the order of a^A The thickness of this ssRNA shell results in 
general from a relatively complicated interplay of all the inter- 
action energies and chain entropy involved in the problem^!! 
the electrostatic contribution to the interactions between neg- 
atively charged ssRNA and positively charged hypotope of the 
capsid and the entropic con tributio n of the constrained poly- 
electrolyte to the free energy^2EE] 

4.1 Electrostatic interactions and energies of ssRNA 
viruses 

The total free energy in this case is composed of the elec- 
trostatic interactions due to ions and charges on the poly- 
electrolyte, as well as the entropy of the polyelectrolyte 
chain Sllini. The electrostatic part of the free energy, F es , con- 
tains the PB functional, Eq. ([2]), augmented by the presence of 
the charges on the polyelectrolyte chain expressed in terms of 
the polyelectrolyte monomer concentration, p(r) and reads^^ 

F a [<Kr)M{r),p{r)] = J f es ^(r),V^r),p(r))d 3 r - 

-pi[fd 3 rp(r)-N), (12) 

where \i is the Lagrange multiplier enforcing the condition of 
fixed number of monomers, N, of the polyelectrolyte chain, 
with 

^((^(r),V(^(r),p(r))=/p 5 ((^(r),V(^(r))-^op(r)^(r). 

(13) 

Here fpB^r) was defined in Eq. [2] and pe$ is the charge per 
monomer, with e$ is the electron charge and < p < 1. The 
part of the free energy due to the entropy of the flexible poly- 
electrolyte canbe approximated in the so-called ground state 
dominance 9 ^ 100 * as 

F ent [p(r),Vp(r)}=k B Tj J d 3 r [ ^^. (14) 

Minimizing the sum of the electrostatic and entropic contribu- 
tions then leads to a polyelectrolyte PB equation that can be 
solved numerically in the spherical geometry of the capsid^l 
Numerical solutions of the polyelectrolyte PB theory are 
complicated but they invariably point to the existence of an 
adsorption layer next to the internal positively charged wall of 
the capsid 111 . This leads to the conclusion that ssRNA should 
be non-uniformly distributed within the capsid, showing a rel- 
atively dense surface layer and a depleted core. The exis- 
tence of the adsorbed layer along the periphery of the capsid 
then engenders the attractive polyelectrolyte bridging interac- 
tions of the type observed to act between planar charged sur- 



facesEEEH ^ k ere act between different parts of the spheri- 
cal hypotope and thus stabilize the protein shell. 



4.1.1 Scaling approach to the ssRNA packing inside a 
capsid. A scaling estimate of the electrostatic free energy of 
ssRNA packing inside a capsid can again be obtained in a suit- 
ably simplified framework and in appropriate range of param- 
eters. It has been found that the optimal virus configuration 
in the physiological regime is such that the total charge on the 
encapsidated ssRNA is comparable to the charge on the cap- 
sid, being equal as the salt concentration decreased 63 ! 90 ! 116 ! 
This is relatively easy to understand. When the charges on the 
capsid and on the ssRNA molecule are equal (but of the oppo- 
site signs), the ssRNA completely screens the protein charges, 
so that the salt ions almost need not to redistribute at all, es- 
pecially when the ssRNA and the capsid can be brought in 
close contact. In all other cases, there is an effective, remain- 
ing charge, that the salt ions must screen by rearranging, in- 
creasing the total electrostatic energy of the system in this 
way. This simple argument is complicated by the fact that 
the ssRNA consists of connected charges - it is a polyelec- 
trolyte molecule. This line of reasoning applies only to the 
total charge on the ssRNA and the capsid. When each nu- 
cleotide is assumed to carry a fixed charge, this also fixes the 
total length of the ssRNA molecule. There are studies, how- 
ever, which predict that not all ssRNA bases carry an elemen- 
tary charge 63 . This depends on the dissociation equilibrium 
and charge regulation^, but it does not importantly influence 
simple energy estimates to be presented below. Similar argu- 
ments can be used als o for en capsidated polyelectrolyte cargo 
different from ssRNAEHESI 

The ssRNA configuration inside the capsid as obtained from 
the full polyelectrolyte PB theory that contain also the poly- 
electrolyte entropy can be approximated by two concentric 
spherical shells with opposite charge, G = 02 ~ — CJi, sepa- 
rated by ~ a (see Fig. [4]). Thus, Eq. ^ should be of use to 
estimate the free energy of such a configuration. The impor- 
tant difference of the ssRNA virus with respect to the dielectric 
slab model of the capsid is that the space between the ssRNA 
and the proteins is permeable to salt and water. However, 
when a < K _1 , the salt induced screening of ssRNA-capsid 
protein electrostatic interaction is incomplete, and the free en- 
ergy of salt ion distribution in between the ssRNA and the 
capsid is small compared to the analogous contributions in the 
rest of the space. 

Thus, a simplest estimate of the electrostatic complexation 
free energy (AFc) of a ssRNA virus in the DH approxima- 
tion can be obtained from Eq. using 8 p « 8 and k8 ~ 1 
(in physiological conditions) and subtracting the electrostatic 
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self-energy of the capsid, Eq. [7] This yields 



2r>2 



AF C * F DH (-a,a,a)-F DH (0,a,0) : 



8 £ 



(15) 

where / is a numerical factor between 4/3 and 2 (it depends on 
K and 8). Note here that the sign of this free energy difference 
depends on the competition between two length scales: a and 

The above result is not very accurate in high salt concentra- 
tions, when K _1 < a and does not include the non-electrostatic 
self-repulsion contribution of the polyelectrolyte which can 
be shown to be smaller then the leading electrostatic contri- 
bution^! Using the "typical" virus parameters and a = 0.5 
nm, we obtain AFc ~ 0, since fa — K _1 ~ 0. A more de- 
tailed calculation 111 gives AFc which is negative and about 
a quarter of the free energy of an empty capsid (at physiolog- 
ical conditions) /*b#(0,G,0), consistent with lower bound on 
/. This signifies that spontaneous encapsidation of ssRNA is 
a delicate process that may even be suppressed in thermody- 
namical equilibrium, so that only empty capsids form. Indeed, 
formation of empty capsids and free ssRNA was theoreticaly 
predicted in the regime of high ionic concentrations, but also 
when the charge on the ssRNA is larger than the charge on 
proteins by a factor of ~ 2 or more 111 . In physiological con- 
ditions, and when the charge on the ssRNA is about the same 
as the charge on the proteins (but of the opposite sign), it was 
found that the complexation free energy is negative so that ss- 
RNA encapsidation takes place. 

4.1.2 Details of the energetics and conformation of ss- 
RNA inside viral shells. A more detailed physical model of 
an ssRNA virus should produce the ssRNA distribution as a 
result of a free energy minimization. Different types of models 
of the ssRNA virus packing have been examined some of them 
emphasizing the discrete nature of the ssRNA^H and some, as 
we have seen in the previous section, representing the ssRNA 
in the continuum limit via the density field p (r) des cribing the 
spatial distribution of RNA "monomers''' 63 ! 1 n l 12Q l Both ap- 
proaches have their advantages and limitations and they nec- 
essarily simplify the physics of ssRNA to some manageable 
model. Going into details of these studies is beyond the scope 
of this review. 

It is of interest, however, to discuss the typical spatial distri- 
butions that are obtained in these models. On panel b) of Fig. 
[4] we show the ssRNA monomer density for several values of 
its length. The results are from Reffl^M One sees that, indeed, 
the ssRNA molecule occupies a shell closely separated from 
the capsid interior surface. As the ssRNA molecule becomes 
longer, and its charge larger from the capsid charge, the inte- 
rior or the capsid gradually fills up. This effect is more pro- 
nounced for higher concentrations of (mono-valent) salt. The 
localization of ssRNA in a shell close to the capsid was also 



found in Refill, in a Brownian dynamics study of ssRNA 
(generic polymer) virus assembly where the electrostatic in- 
teractions were modeled via effective pairwise potentials^!! 
and in molecular dynamics simulation of assembled Satellite 
Tobacco Mosaic virus in Ref.^l. 
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Fig. 4 Panel a): The free energies of the capsid-ssRNA complexes 
as a function of the number of RNA bases and for three different 
monovalent salt concentration, as indicated. The free energy of the 
empty capsid is obtained when N = 0. Panel b): The ssRNA 
(a = 0.5 nm) concentration profile in a capsid of radius R = 12 nm 
and surface charge density a = 0.4 eo/nm 1 for cq=\00 mM. The 
curves displayed correspond to N =100, 300, 500, 700, 900, 1 100, 
1300, and 1500 bases. The lines are styled so that the length of their 
dashes is proportional to N. See Ref. 111 for details. 



It was found that the energetics of ssRNA viruses also de- 
pends on the details of spatial distribution of the capsid charge, 
eop p (r), in particular on its derealization on the capsid pro- 
tein tails that protrude into virus interior^D (see Fig. \5j^\ 

§ Here we see the importance of the electrostatic interactions that is directly 
reflected in the shape of a virus. Furthermore, they probably underlie and 
influence the evolutionary pathway of a particular virus, whose evolutionary 
fitness is to a certain extent determined by the electrostatic interactions that 
govern its (dis)assembly. 
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Fig. 5 One half of the cucumber mosaic (ssRNA) virus capsid 
(strain FNY). The image was constructed from RCSB Protein 
Databank entry 1F15 in the same manner as the one panel c) of Fig. 
[2] One can observe "buttons" of positive charge density formed by 
the tails of capsid proteins protruding into the interior of the virus. 

The viruses that have this feature were found to be more sta- 
ble with respect to changes in the ionic concentration, so that 
the length of the ssRNA that is encapsidated in the thermody- 
namically optimal conditions does not vary significantly with 
the salt concentration ! 63 ! 1 1 . Figure [6] shows the free energies 
of capsid-ssRNA complexes (panel a) and the profiles of the 
ssRNA density (panel b) obtained in the model that, in con- 
trast to results shown in Fig. |4| accounts for the delocalized 
capsid charge, i.e. the effects of protein tails. The free ener- 
gies do not contain the part related to attractive protein-protein 
interaction, so that the empty capsid electrostatic free energies 
(to be compared with e.g. Eq. ([7])) can be read out from the 
figures as the free energies of the complex in the limit when 
number of ssRNA monomers goes to zero. One sees that the 
optimal encapsidated ssRNA length (positions of the minima 
in the complex free energy curves as a function of the ssRNA 
length^ in the case of infinitely thin capsid) is practically fixed 
at the position where the total ssRNA charge equals the cap- 
sid charge (vertical thick dashed line in panel a) of Fig. [6]), 
almost irrespectively of the concentration of monovalent salt. 
One also sees that the thickness of the ssRNA shell becomes 
influenced by the length of the charged protein N-tails, in ad- 
dition to its dependence on the screening length and monomer 
size (compare Figs. [4] and [6]). 

The optimal encapsidated ssRNA length was found to vary 

HThe free energies at these points are estimated by Eq.dl5l. 



much more in the case of infinitely thin capsid (panel a) of 
Fig. [4]), so that at salinities above about 700 mM, the thermo- 
dynamically optimal ssRNA length drops down to zero and 
the model predicts formation of empty viral shells only In 
sufficiently low monovalent salt concentrations (10 mM - 100 
mM), it was found that viruses containing the ssRNA that has 
about two times more charges than the capsid, are still able to 
self-assemble (i.e. that their total free energy is smaller than 
the free energy of assembled empty capsids and free-floating 
ssRNA, see Fig. [6]). 
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Fig. 6 Panel a): The free energy of the capsid-ssRNA complex as a 
function of the total number of ssRNA bases. The capsid radius is 
R=12 nm. The capsid charge is distributed in a spherical shell of 2.5 
nm thickness. Panel b): The ssRNA density profiles for N =100, 
300, 500, 700, 900, 1100, 1300, and 1500. A vertical line at r=9.5 
nm indicates the position of the N-termini of the capsid proteins, i.e. 
the interior surface of the charged protein shell. 

This was confirmed in two quite different studies, one em- 
phasizing the continuum aspects of the ssRNA 111 (the total 
charge on the ssRNA allowed thermodynamically and found 
in this study is about twice the charge on the capsid), and 
the other its discrete features 116 (there the total charge on the 
ssRNA allowed thermodynamically is about three times the 
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charge on the capsid). This means that it is possible that the 
viruses carry effective charge, although the most stable state of 
an ssRNA virus is the one where the charges on the ssRNA and 
the capsid are about the same, i.e. when the effective charge is 
close to zer o - this was found both in continuum approaches in 
R e f s J631ilH and in the approach emphasizing ssRNA discrete- 
ness in Ref EH. 

All the studies discussed thus far agree that the ssRNA 
packages into a shell in the immediate vicinity of the inner 
capsid wall. There is also an interesting question of its or- 
dering 123 . Being very flexible, ssRNA is much more diffi- 
cult t o orde r even in a confined space then the much stiffer 
It seems likely that the spatial inhomogene- 
ity of the capsid charge, obeying the icosahedral symmetry 
of the capsid, may be responsible for ordering of the ss- 
RNA molecu lel^ that has been observed in several exper- 
imentsESli£2J ? but also in simulations in RefJEH. It is also 
of interest to note that the ssRNA may locally form double 
stranded RNA hairpins, increasing the binding energy in this 
way, both via the increase of packing density and occurrence 
of inter-base hydrogen bonding. 

The calculations we presented do not take into account the 
inextensibility of ssRNA, i.e. the fact that the RNA monomers 
are linked together by chemical bonds that can not be stretched 
beyond their chemical size^l In cases of sufficiently long 
and flexible ssRNA, this fact should not influence the energy 
significantly, however, it may be of importance for smaller ss- 
RNA lengths. In such cases, not all monomer density distri- 
butions predicted by continuum calculations can be realized 
by connected monomers, and one has to explicitly account for 
inextensibility as was done in Ref. 127 , by introducing the max- 
imal extensibility constraint in the free energy functional. 

5 Energies of bacteriophages 

The formation of bacteriophages in an infected bacterium pro- 
ceeds by the formation of empty protein capsids first. The 
DNA is then inserted into a (pro)capsid^, an entropically and 
enthalpically unfavorable process^ that can be accomplished 
only via ATP-driven molecular motor 129 . 2 to 2.5 base pairs 
of DNA are packaged by one ATP molecule and there appear 
to be strong electrostatic interactions between the DNA phos- 
phate backbone and the positively charged internal molecular 
motor waflES] it is not entirely clear whether there are also 
significant electrostatic capsidprotein-DNA interaction as we 
have seen in ssRNA viruses N but is an experimentally ob- 



|| The bacteriophages do not enter the bacteria as complete particles, rather, 
they insert their DNA through the bacterial membrane, leaving the capsid 
on the outside. Strong protein-DNA interactions in bacteriophages may be 
disadvantageous with respect to the insertion of the DNA, i.e. they may induce 
"sticking" of the DNA to the interior of the capsid and thus prevent its entry 
through the bacterial membrane. 



served fact that DNA molecule in the bacteriophages is nearly 
homogeneously dist ributed in their interior, so that its den- 
sity is quite uniformEmSl. This is an important difference 
in charge distribution with respect to the ssRNA viruses, and 
the reason for quite different energetics of the two types of 
viruses. The region of DNA close to the capsid shows an 
onion-like ordering, while the DNA in the capsid center is 
typically less ordered^!!. The outer layers most likely have 
a toroidal, spool-like geometry^!, but there are other sugges- 
tions proposing a liquid crystalES a folded toroidEMS a 
folded coaxial spool 137 and the spiral 

fokPSconfi 

durations. 

The physical description of the ordered packing of the ds- 
DNA inside the bacteriophage capsid is usually captured in 
one of the variants of the inverse spool model first invoked in 
the study of the T2 bacteriophage! 1 39 n 141 1 and formalized by 
Grosberg et al. 142 . The inverse spool model has been sub- 
sequently refined by Odijk and Gelbart and coworkersESESD 
and is based on the decomposition of the dsDNA total energy 
within the viral capsid into an interaction term and a bending 
term which lea ds to a reasonable description of the genome 
ejection processEili^J. Apart from the detailed simulation 
approaches to the DNA packing within the capsidEMSI] a rj 
theoretical work is based on assumptions regarding the form 
of the curvature energy of the DNA forced to reside within the 
confines of the capsid, as well as the interactions among the 
highly charged and hydrated DNA segments packed at high 
densities within the capsid. 

Elastic curvature energy appears to be the lesser of the two 
unknowns, though some very recent work might point to the 
contrary^! It is proportional to the square of local DNA cur- 
vature and in fact follows from the Euler-Kirchhoff model of 
an elastic filament. Though this model contains some sub- 
tle features due to the strong interhelical forces between the 
segments of the moleculeEH, it nevertheless appears to be a 
consistent description of DNA^^ on mesoscopic scales^! 
The parameters of the Euler-Kirchhoffian model of DNA, such 
as its persistence length, are well established and have been 
measured by a variety of methods with satisfactory consensus 
among the results^! 



5.1 Electrostatic interactions and energies of bacterio- 
phages 

As the DNA molecule is highly charged (two elementary 
charges per base pair), its insertion in the bacteriophage in- 
terior results in a buildup of repulsive electrostatic force and 
ener gyLL60| electrostatic energy of a rough model where 
the DNA molecule is homogeneously distributed in the capsid 
interior can be estimated in the DH framework using a tech- 
nique similar to that presented for derivation of Eq. ([7]). In the 
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limit kR ^> 1, one obtains 



lim F DH = 

K#»l 



2p z R"% 
3K 2 8 £ ' 



(16) 



where p is the volume density of the DNA charge (2 elemen- 
tary negative charges per base pair). The electrostatic energy 
can now be estimated in the case of X bacteriophage, whose ra- 
dius is R w 30 nm, and whose DNA contains 7^=4 1500 base 
pairs. The density is p ~ 0.7 eo/nm 3 , and the electrostatic en- 
ergy in physiological conditions Fdh ~ 3 10 5 ksT. This is at 
least an order of magnitude larger from the repulsive electro- 
static energy of ssRNA viruses, although the protein-DNA in- 
teraction in bacteriophages may reduce this energy somewhat 
(the attractive protein-DNA contribution scales with R 2 , how- 
ever, so it becomes less relevant for larger bacteriophages). 
The total free energy balance upon assembly can hardly be 
made negative by the attractive protein-protein interactions. It 
then follows that the bacteriophages can not self assemble [**] 
and they must therefore use other means of assembly. 

Electrostatics accounts for only a part of the interaction en- 
ergy between DNA segments. In fact these interactions can be 
measured directly in osmotic stress experiments^^ and can 
be deconvoluted into a longer ranged electrostatic contribu- 
tionES an( i a shorter ranged hydration component 163 . Both 
of them have been quantified in terms of magnitudes and de- 
cay 

lengthsESfl. The various formulations of the inverse spool 
model mostly differ in terms of the exact form of the interac- 
tion potential. While s ome are b ased entirely on theoretical 
polyelectrolyte models! 143 ! 144 ! 165 ! others are based on semi- 
empirical chemical potential expres sions 146 l 1 47 l 149 l 1 5Q I . The 
best strategy would be of course to use directly the measured 
osmotic pressure from the osmotic s tress experiments as an 
input for the formulation of the theory^! 

5.2 Elastic energy of DNA packing 

A simple calculation we used to estimate the electrostatic en- 



ergy of the bacteriophage DNA, Eq. (16), does not take into 
account the intrinsic stiffness of the dsDNA and the estimate 
obtained would be the same were the DNA base pairs com- 
pletely disconnected from one another, i.e. packed homoge- 
neously within the capsid as a cloud of charged monomers. 
The fact that DNA is confined and thus substantially coiled 
brings its bending energy clearly into focus. The bending elas- 
ticity of DNA can be accounted for within the Euler-Kirchhoff 

**The attractive energy of protein-protein interactions scales with the total 
length of protein contacts (buried surface area), i.e. with R 2 . In case of 
ssRNA viruses, the repulsive electrostatic energy, for given surface charge 
density scales also with R 2 , which suggests that ssRNA viruses of similar sur- 
face charge densities but different radii (i.e. made of similar proteins, but of 
different T numbers) can all rely on attractive protein-protein interactions in 
order to self assemble. This is very unlike the case of bacteriophages where 
the electrostatic energy of DNA, for given packing density scales with R 3 . 



elastic energyE^L^ and introduces an additional length scale, 
Lp, to the problem, referred to as the persistence length^] 
The persistence length represents the length scale over which 
the direction of the DNA is correlated and was measured to be 
about 50 nm in physiological conditions 1 68 l 169 l. One should 
note here that this number includes also an electrostatic con- 
tribution to the DNA elasticity, i.e. the fact t hat cha rges on 
an elastic string also contribute to its elasticity 168 1 170 ! , as they 
want to remain as distant as possible, favoring thus flat DNA 
strand conformations. One can describe this as an electrostatic 
renormalization of the "bare" (chemical) DNA elasticity, i.e. 
the elastic ity that th e DNA strand would have were its bases 
uncharge d' 157 ^ 17 1 I The bare persistence length, Lp is thus 
smaller than 50 nm. 

To estimate the elastic contribution to the DNA pack- 
ing, one ca n use techniques similar to those presented in 
Refs | i73|i74 | The elastic 

energy of packing can be expressed 
in different equivalent continuum forms^^. In what follows 
we use the form 



^elastic 



£pLpp 



k R T / d 5 r 



PBp(r) 
R 2 (r) ' 



(17) 



where Lp is the persistence length of DNA, Lpp = 0.34 nm is 
the spacing between the DNA base pairs, ppp(r) is the DNA 
base-pair density (the charge density is thus p(r) = 2eop bp (r)) 
at a radial distance r from the axis of DNA packing, and R 2 (r) 
is the squared radius of curvature of DNA, the assumption 
being that the configuration of DNA orientational order has 
cylindrical symmetry. 

The elasticity of DNA should induce an inhomogeneity in 
the DNA packing density, psp(r), which would be constant 
were the electrostatic energy the only part of the total free en- 



ergy. Looking at Eq. 17 one sees that the elastic contribution 
diverges as the radii of curvature approach to zero. Assum- 
ing that the DNA is packed in a bacteriophage capsids in a 
spool-like manner, the regions which are most disfavored with 
respect to elastic energy are those close to the spooling axis, 
where the radius of curvature, R, is small, decreasing to zero 
on the axis. A reasonable guess of p(r) (or variational ansatz) 
that includes the effects of elasticity would be a homogeneous 
density of DNA outside a cylinder of radius Ro, that drops to 
zero inside the cylindrica l void, i.e. the zone of DNA exclu- 
sion due to elastic effects 174 175 . This approach is amenable 
to analysis, as it allows one to combine the elastic and electro- 
static energies of a DNA distribution with exclusion radius Ro, 
and to find the radius, requiring that the total energy be mini- 
mal. The result in the case when kR > 1, Ro <C R, N B p > 1, 
R = 30 nm (bacteriophage X), and Lp = 50 nm (maximal esti- 
mate for bare DNA persistence length) is^ 



10R 



(18) 
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This equation indicates small effects of elasticity on the dis- 
tribution of packed DNA. For fully packed bacteriophage X, 
Nbp = 41500, and the exclusion radius amounts to Ro ~ 0.057?. 
This is smaller (1.5 nm) than the diameter of the DNA strand 
(~ 2.5 nm) which means that the exclusion void, if it exist, is 
determined by the discrete nature of the DNA molecule, and 
the minimum possible length of a DNA kink. Note also that 
the value of the persistence length that we used in the calcu- 
lations above (50 nm) includes some of the effects of electro- 



static interactions that have already been included in Eq. ( |T6| ), 
so that the void radius is expected to be even smaller. 

I t is als o possible that nonlinear elastic effects and knot- 
ting! 35 1 176 1 ma y bp f importance for such a highly bent DNA 
configuration-Gil. The relative contribution of elastic energy 
to the total energy of packing in this model scales with Nbp 
as IyiNbp/Nbp. In the case of fully packed bacteriophage, it 
amounts to less than 5 % of total energy 62 . A similar es- 
timate has also been obtained on the basis of experimental 
data in Ref. 30 . Smallness of elastic contribution to the total 
energy has been discussed in RefM^, using different tech- 
niques that do not attempt to estimate the electrostatic con- 
tribution to the DNA packing but rather utilize the data from 
bulk DNA osmotic pressure experiments^^! in order to obtain 
the required information. The reg ion of depleted DNA density 
is not seen in experiments 1 3 1 1 178 [ Rather, a more disordered 
phase of DNA is seen around the center of the capsid, which 
does not have an "onion-like" type ordering characteristic for 
DNA packed close to the capsid interior. 

The contribution of DNA elasticity to bacteriophage pack- 
ing energy and the inhomogeneities in the density of the en- 
capsidated DNA ar e both q uite small when the DNA is in a 
non-condensed statePSEH] i >e# when the interactions between 
the DNA segments are repulsive. In fact this is always the case 
either when the counterions in the bathing solution are mono- 
valent and/or there are no crowding agents - such as poly- 
ethylene-glycol - present in the bathing solution. Monovalent 
counterions can change the strength of intersegment DNA in- 
teractions but they can not change their character, i.e. they can 
not turn repulsive interactions into attractive interactions'^. 

Monovalent salt ions in general renormalize the bare phos- 
phate backbone charge along DNA as well as determine the 
range of electrostatic interaction via their screening properties. 
The electrostatics of the DNaE^EO] becomes complicated in 
the presence of strongly charged counterions, basic proteins, 
crowding agents, the DNA confinement, etcM^ . One of the 
salient features of these complicated interactions is that adding 
even small amounts of polyvalent counterions (spermine, sper- 
midine, CoHex, ...) changes the nature of the electrostatic in- 
teractions so that they can turn attractive even between nomi- 
nally equa lly charge d DNA segments where one would expect 
repulsion s 30 ! 183 ! 18 ^ . The same effect can be observed also in 
the bacteriophage with a partially ejected genome 131 where 



addition of spermine condenses the remaining DNA into a 
toroidal spool completely contained within the bacteriophage 
capsid. In this case the contribution of the elastic energy to the 
equilibrium free energy is non-negligible being in fact essen- 
tial in order to counteract the attractive interactions that would 
tend to compact DNA into a disordered globule. 

6 Osmotic pressure in viruses 

Since solvent can equilibrate across the viral capsid the me- 
chanical pressure acting on the capsid wall equal s the os motic 
pressure of the solution enclosed within the capsid 185 ' 186 . This 
osmotic pressure has various contributions that differ in the 
case of flexible ssRNA viruses as opposed to the stiff dsDNA 
bacteriophages due to the nature of the polymer elasticity. In 
the former case we have purely entropic elasticity stemming 
simply from the connectivity of a flexible chain, while in the 
latter case we have enthalpic elasticity of a semiflexible Eule- 
rian filament very different in form and magnitude^!] 

The differences in molecular flexibility of the encapsidated 
nucleic acid component furthermore engender also qualita- 
tive differences in the ordering of the genome. In the case 
of ssRNA its flexibility implies, in the simplest case, a com- 
pletely disordered polymer solution of homogeneous den- 
sity as can be observed experimentally with e.g. hyaluronic 
acid 187 , while in the case of dsDNA at high concentrations its 
stiffness l eads to an orientationally and positionally ordered 
mesophas d 188 ' 189 i This idealized dichotomy ceases to be valid 
in the context of viruses at special conditions when ssRNA can 
assume a more ordered hypotope-bound configuration reflect- 
ing the icosahedral symmetry of the inner capsid wall^l The 
packing of dsDNA within bacteriophages seems to be gov- 
erned by the stiffness of DNA that leads to the formation of 
local nematic alignments 134 whose orientational order is con- 
strained by the bacteriophage capsid. The orientational order- 
ing of dsDNA within the bacteriophage can thus be seen as a 
type of constrained liquid crystalline ordering that c an be ana- 
lyzed by a local thermodynamic mesoscopic theory 124 I 125 I 19Q I . 

In a thermodynamic equilibrium osmotic pressure at every 
point within the capsid has to be t he sam e and can be set by 
external osmoticants such as PEG 191 ' 192 i For various mod- 
els of the viral core comprised of the nucleic acid component 
and the bathing ionic solution, the total osmotic pressure can 
be decomposed into separate terms stemming from the differ- 
ent components of the bathing solution. Within the mean-field 
approximation of the ionic component of the bathing solution 
it follows that its osmotic pressure is given by the van't Hoff 
ideal osmotic pressure of all the ionic components^ at the hy- 
potopic wall of the capsid. As for the nucleic acid component 
it is given by an additive term due to either the entropic elas- 
ticity of a flexible chain^' or enthalpic elasticity of a semi- 
flexible chainQ22. 
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Fig. 7 Pressures sketch for an ssRNA [Panel a)] vs dsDNA virus [Panel b)]. Genome is shown in blue, capsid in red, arrows indicate the 
direction of pressure. Cross section through the capsid. In the ssRNA case the genome is disordered but shows a distinct boundary layer close 
to the hypotope of the virus with a depleted region in the middle. In the dsDNA case the genome density within the capsid is (almost) uniform, 
showing pronounced orientational and positional ordering. 



Specifically, the case of flexible polyelectrolyte chain(s) 
such as ssRNA in an ionic solution the total osmotic pres- 
sure decouples into two contributions^^ the first one is the 
osmotic pressure of ions inside the capsid due to electrostatic 
interactions between the ions themselves, as well as between 
the ions and the polyelectrolyte and the capsid charges; within 
the PB frame it equals the ideal van't Hoff expression for 
all the components of the solution, except the polyelectrolyte 
chain, evaluated at the inner surface of the capsid wall; the 
second one takes into account the connectedness of the poly- 
electrolyte chain and its interaction with the mean electrostatic 
field. While the first term is positive, the second one can be 
of either sign. When it becomes negative we refer to it as the 
polyelectrolyte bridging contribution 107 . Neither of the two 
terms is in general dominant and the overall sign is due to a 
subtle interplay of electrolyte and polyelectrolyt e properties of 
multicomponent solution within the virus capsidl 102 ' 106 ' 107 ' 

For the stiff dsDNA polyelectrolyte a different decomposi- 
tion can be derived^! In that case the osmotic pressure again 
decouples into two terms: the first term now corresponds to 
all the self-interactions between the segments of the encapsi- 
dated DNA moleculeM^J whereas the second one is the contri- 
bution of the elastic bending deformation of th e ordere d DNA 
mesophase imposed by the capsid confinement! 1 24 l 1 93 [ While 
the second term is strictly positive, since any deformation of 
the orientation of the DNA molecule increases its free energy, 
the first one can be of either sign depending on the nature of 
interactions between DNA molecules 162 . In principle it con- 
tains all interactions between DNA molecules that could be 
either electrostatic or non-electrostatic in nature. Comparing 



the measured interaction term with the curvature term one can 
make a general conclusion that the latter is important only in 
the immediate vicinity of the central axis of DNA packing 
symmetry, consistent with the model calculations of bacterio- 
phage energy presented in the previous section. 

We now estimate the magnitude of the osmotic pressure in 
both types of viruses by making a very restrictive assumption 
that the interactions are strictly electrostatic in origin. This 
allows us to use simple scaling arguments in the derivation. 

6.1 Osmotic pressure in self-assembled ssRNA viruses 

The scaling form of the polyelectrolyte osmotic pressure, ppe, 
acting in ssRNA viruses can be obtained from equation Eq. 
( [T5] ) if one assumes that electrostatic interactions can be de- 
scribed on the DH level and that confined ssRNA distribution 
can be described as a dense hypotopal layer of thickness a and 
a depleted core. One first needs to express the equation for 



the virus free energy Eq. 15 within the DH approximation in 
terms of the total charge on the capsid (Q) and on the ssRNA 

(~-G), 



Fdh ~ F DH (-G,a,a) 



fQ 2 a 
167? 2 718 £* 



(19) 



The osmotic pressure of the polyelectrolyte (PE) chain can 
then be calculated from the appropriate derivative of the free 
energy at constant number of surface charges, 



1 dFj 
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DH 



4R 2 n dR 



(20) 
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The sign here corresponds to the standard definition with pres- 
sure being positive, i.e. repulsive, for free energy that decays 
with radius. This gives 



Ppe 



fQ 2 a 
32R 5 % 2 E e' 



(21) 



For a "typical" ssRNA virus, this gives ppe ~ 10 atm, acting 
to increase the capsid radius (outward). There is, however a 
component of pressure missing in the above evaluation. This 
is the component related to attractive protein-protein interac- 
tions in the capsid. 

Namely, there is an outward pressure acting in the empty 
capsid also, since in this case the charges on the capsid de- 
crease their energy by separating. On the DH level this purely 
electrostatic (ES) component of osmotic pressure is then given 
by 



empty 

Pes 



1 dF^ pty 



4R 2 % dR 



Q 
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, (22) 



where F^ py is the free energy of an empty capsid given in 
Eq. ([7]). If we assume that the empty capsid of an ssRNA 
virus in question can self-assemble, we are led to propose that 
the repulsive electrostatic pressure in empty capsids is exactly, 
or to a good proportion canceled by pressure arising from the 
attractive interactions. Were the proteins infinitesimally small 
particles, the two pressures should exactly cancel in the as- 
sembled empty capsid. 

If we include the attractive protein-protein interactions in 
the calculation of total pressure, it is then given by 



P = Ppe + pes ' 



Q 2 



32# 5 7t 2 £o£ 



(fa- 



(23) 



Since fa ~l nm, and K _1 ~ 1 nm, we see that the total pres- 
sure in ssRNA virus almost vanishes, since the different con- 
tributions nearly perfectly cancel. 

In more detailed calculations of osmotic pressure 111 , it was 
found that the pressure in ssRNA virus can be either positive 
or negative, depending on the total charge on the virus and its 
relation to other parameters in the problem, see Fig. [8] but it 
was found to be slightly negative (half an atmosphere) at the 
point of optimal assembly, i.e. at the point where the length of 
the ssRNA is such to minimize the complexation free energy 
functional. 

The negative values of osmotic pressure can be understood 
by analogy with charged planar surfaces with an oppositely 
charged poly electrolyte chain in between and are due to bridg- 
ing configurations of the chain^l One could then connect the 
negative values of the osmotic pressure in the case discussed 
here, with a similar mechanism. The ssRNA enclosed within 
the capsid would thus bridge the space between neighboring 
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Fig. 8 Panel a): Osmotic pressure acting on the viral capsid as a 
function of the number of monomers. Panel b): Osmotic pressure 
for fixed number of monomers N = 420, co= 100 mM [the point 
denoted by a circle in panel (a)] as a function of the capsid charge 
density, a. The capsid is assumed to be infinitely thin, with radius of 
R=\2 nm. Other parameters of the calculation are the same as in 
Fig. [^(adapted from Ref ECO). 



sections along the curved capsid and induce attractive inter- 
actions between them. Summing these attractions along the 
total surface of the capsid would give rise to an overall neg- 
ative osmotic pressure of the poly electrolyte. This interpre- 
tation is certainly corroborated by the characteristic density 
profile of the polyelectrolyte concentration in the vicinity of 
the capsid wall, which is in fact very similar to the bimodal 
polyelectrolyte concentration pro file obse rved in the case of 
planar polyelectrolyte confinement 1Q2 I 1Q6 [ 



6.2 Osmotic pressure in bacteriophages 

In the case of the dsDNA bacteriophage the osmotic pressure 
due to DNA self-interaction is in general much larger than the 
one due to the DNA elasticity, which we can thus safely ig- 
nore. Assuming furthermore, that the only interaction between 
DNA molecules is electrostatic we can estimate its magnitude 
with scaling arguments. There is nevertheless large variation 
in the magnitude of the osmotic pressure for various virus 
types even if it is mostly of electrostatic nature^! 

Osmotic pressure arising from the dense packing of DNA in 
bacteriophages can be obtained by examining how the electro- 
static energy of encapsidated DNA changes upon the increase 
of the capsid radius. The energy should be expressed in terms 
of the total number of DNA base pairs which is conserved in 
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the process of infinitesimal increase of the radius, i.e. 



1 dFj 



DH 



4R 2 % dR 



N BP 



Using Eq. ( 16 ) in the limit of kR ^> 1 this yields 
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(24) 



(25) 



which to the lowest order coincides with the estimates based 



on the Donnan potential ^ 44 l 1 65 1 1 95 1 Osmotic pressure is thus 
positive corresponding to a repulsive force acting on the capsid 
wall trying to increase its radius. The above equation can be 
evaluated for bacteriophage X and it gives p ~ 100 atm, which 
is somewhat higher than measured in experiments^! On one 
hand, this is the pressure that the bacteriophage capsid needs 
to withstand, and on the other this is the pressure of the DNA 
coiled osmotic spring piled up against the inner surface of the 
capsid ready to release its chemical and mechanical energy 
through the portal complex on docking onto a bacterial wall^l 
For real bacteriophages at a dsDNA concentration corre- 
sponding to ~ 2.7 nm inter DNA spacing, which amounts to 
~ 500mg/ml, the osmotic pressure is given not only by the 
electrostatic interaction between DNA molecules'^ but also 
by the hydration interactions stemming from the ordered vici- 
nal layers of water close to the DNA surfaceC52. 

6.3 Osmotic pressure: ssRNA viruses vs. dsDNA bacte- 
riophages 

Osmotic pressure of the two types of viruses'^ differs cru- 
cially in its magnitude as well as sign. While for ssRNA it 
is usually small and could be negative, it is strictly positive 
and large for bacteriophages. Negative osmotic pressures are 
ubiquitous in polymer shells that are impenetrable to osmoti- 
cants L!97]Q29] diff erence f osmotic pressures also mirrors 
the very different assembly paths of the two types of viruses: 
spontaneous self-assembly at almost vanishing osmotic pres- 
sure difference vs. non-equilibrium (active) packing mech- 
anisms where DNA encapsidation has to fight an enormous 
positive osmotic pressure inside the capsid^^. These salient 
features of osmotic pressure carry on even into more realistic 
models of interactions between the nucleic acid component of 
the viral core 162 . 

The scaling forms of electrostatic interaction contribution 
to osmotic pressure of ssRNA and dsDNA viruses also convey 
another interesting contrast between the two. While in the 
case of the dsDNA bacteriophages the electrostatic interaction 
energy stems largely from the interactions acting between the 

segments of nucleic acid component within the volume of the 

WT 



virus , in the case of the ssRNA viruses the most important 
contribution comes from the interaction between the ssRNA 



genome and the hypotopic surface of the capsid. This contrast 
in the interaction mechanisms shows up eventually as the R~ 6 
scaling for DNA viruses vs. the R~ 5 scaling for the ssRNA 
viruses in the simple picture introduced above. 

7 Electrostatic effects in viruses for highly 
charged counterions 

Many bacteriophages require Mg 2+ , Zn 2+ and Ca 2+ ions in 
order to attach to bacterial membrane 14 . Spermine (Spm 4+ ) 
and other multiply charged polyamines have been reported 
in a variety of viruses^! Arginine rich protamines are es- 
sential for sper m head c ondensation and DNA stabilization in 
spermatogenesis'^^^ and the effect of long polycations such 
as poly-L-lysine, poly-L-arginine and linear and branched 
polyethylene imine on interactions between DNA molecules 
have been studied in detail 1 205 | 2Q6 1 

The electrostatic interactions mediated by the polyvalent 
counterions can not in general be discussed within the mean- 
field ansatz and their presence has consequences that do not 
have their counterpart in the monovalent casein. We now un- 
derstand that polyvalent counterions even in mM concentra- 
tions mediate a different type of electrostatic interactions than 
the monovalent counterions' 40 ! 41 ' They can drastically modify 
either interactions between the nucleic acid component of the 
virion, or the interaction between the capsomere subunits of 
the capsid. In both cases their effect could most probably be 
ascribed to a diminished repulsion or indeed net attraction be- 
tween the similarly charged macromolecular subunits^! In 
the context of virus electrostatics these effects have not been 
analyzed yet and can be of several types that we address sepa- 
rately. 

7.1 Specific binding of polyvalent counterions 

Ions even of the same net charge have many properties which 
are specific for each ion type 47 . It is well known that their in- 
teraction with the DNA surface is not governed by the charge 
only but is a complicated combination of their size, hydra- 
tion and polari zation properties'^. Ion specific effects are 
quite important ! 183 ' 184 ' but are notoriously difficult to under- 
stan( j|209|2io] Specifically in the context of DNA interactions 
the nature of the ions sets the magnitude of the repulsive elec- 
trostatic interaction in the case of monovalent counterions^^ 
as well as the emergence of counterion mediated attractions 
leading to the onset of DNA condensation"^. At present the 
best one can do is to include the ion specific effects on some 
kind of a phenomenological level as is the case when dealing 
with counterion adsorption along DNA 181 . 

Ionic specificity becomes particularly important when ana- 
lyzing the details of the DNA ejection mechanisms 213 or cap- 
sid protein assembly and stability^! Capsid proteins often 
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contain Angstrom- sized "voids", holes, i.e. three-dimensional 
empty spaces which act as geometrical sieves, enabling entry 
of ions of small enough size^. When the void is surrounded 
with a particular distribution of protein charges, the void may 
act also as an electrostatic sieve, permitting entry and local- 
ization of only sufficiently charged ions of proper effective 
size. In such a localized configuration of counterions and the 
protein, even quantum effects may be important, which may 
further differentiate between specific electronic structure of 
different ions. All this may result in a specificity of counte- 
rion binding. Note that the mechanism discussed gives rise to 
a localization of the counterions, an effect that is not included 
in the PB and DH approaches. This results in effective bind- 
ing of the counterion in the holes formed by charged protein 
groups 81 , an effect that in the context of protein interactions 
is usually known as coordinate bonding and involves di- and 
trivalent ions of Fe, Zn, Co, Ca, Mg, as well as other metallic 
ionsES. 

The role of such ions may be important both for establish- 
ment of the capsid protein fold (i.e. the functional configura- 
tion of the capsid protein or proteins in general 215 ), and also 
for the binding of proteins to form a capsid. In this respect, 
the presence of these ions may act as a switch, yielding an ef- 
fective attraction between the proteins when the (multivalent) 
specific counterions are added in required (small) concentra- 
tions. Attractions of this type can be invoked also in other 
contexts of the nano-scale interactions 216 . Thus, these effects 
that can be quite difficult to quantify can be considered as con- 
tributing to the attractive component of the protein-protein in- 
teractions, and the approach presented still serves to establish 
the importance of monovalent ions in the repulsive electro- 
static interactions between the proteins. 

7.2 "Dressed counterion" approximation: nonspecific ef- 
fects of polyvalent counterions 

As we already stated electrostatic interactions in the presence 
of polyvalent counterions or highly charged macroions can not 
be properly understood on the mean-field level. Sometimes 
even the sign of the interaction predicted by the mean-field 
theory is wrong. While this shortcoming of the PB theory has 
been known in the general colloidal context since the mid- 
eighties^^H it made its debu t in the study of DNA inter- 
action only about 10 years later 219 ! 220 ! The salient features, 
not the ion specific effects, of the polyvalent counterion me- 
diated interactions can be formulated analytically based on 
the strong coupling electrostatics that has been pioneered by 
Rouzina and Bloomfield 221 , elaborated later by Shklovskii et 
a i [222] L evm qi a i J223J an( j i3 r0U ght into final form by Netz et 
a i | 4i|224 | ^ n important feature of the strong coupling regime is 
that the counterion mediated interactions between nominally 
equally charged macroions can become attractive and can in- 



deed cause condensation of DNA^^SI. 

As the bathing solution of viruses often requires the pres- 
ence of polyvalent counterions, the question remains of the 
nature of the polyvalent counterion effect in the electrostat- 
ics of viruses, interactions between DNAs notwithstanding. 
We are not addressing the specific binding effectsESESl b ut 
the non-specific universal features of the polyvalent counteri- 
ons. These effects have not been analyzed in detail yet, but 
there are some features that can be addressed without delv- 
ing too deeply into the theory of strongly charged Coulomb 
systems. Usually, the aqueous solution of viruses contains a 
mixture of monovalent as well as polyvalent salts at relatively 
high and low concentrations, respectively. An intermediate 
approximation termed the dressed counterion (DC) approxi- 
mation allows for an approximate treatment of all the com- 
ponent s in thi s highly asymmetric multicomponent electrolyte 
syste m 228 l 229 [ It starts with the observation that the concentra- 
tion of polyvalent counterions is usually small and thus can be 
dealt with on the lowest order virial expansion level, whereas 
the monovalent salt ions can still be treated on the mean-field 
DH level. 

Within the DC approximation the electrostatic free energy 
of a charged shell is composed of two parts: the first one is 
the DH free energy that we already evaluated before, Eq. ([7]), 
and is due to the electrostatic interaction between the charges 
on the capsid and the monovalent salt ions of the bathing 
ionic solution. The second part is due to the presence of a 
small concentration of polyvalent counterions interacting via 
the DH screened potential. In the limit of small concentrations 
and highly charged counterions the contribution of the poly- 
valent counterions can be well approximated by the first order 
virial expansion corresponding to a single counterion interact- 
ing with all the charges in the system, i.e. the monovale nt salt 
ion charges as well as the fixed charges on the capsidlSHESD 
The grand canonical potential can thus be written 

(3<D = VF DH -£F$ = 

= $F DH -n DC / exp(-Pe v(b//(r))d 3 r, (26) 
Jv 

where nr>c is the bulk concentration of the added v-valent con- 
terions. In a typical case of virus assembly, the solution con- 
tains relatively large concentration of monovalent salt (~ 100 
mM) and by comparison a small concentration of v-valent ions 
(~ 1 mM). The electrostatic potential ^dh is again the solution 
of the Debye-Hiickel equation for the given system. 

The DC part of the total electrostatic free energy contribu- 
tion scales quite differently with the capsid surface charge 
density, a, and the DH screening length, k, than the mean- 
field contribution Fdh [Eq. ([?}]. This is also the case when 
the DC calculation is performed for a sphere homogeneously 
filled with charge, which is a simple model of a bacteriophage 
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[with the free energy in the DH limit given by Eq. ( [T6| )] . Since 
the v-valent ions and the bulk/surface charges are usually of 
opposite signs, the DC part of the total free energy acts to 
lower the DH contribution, with the possibility of eventually 
completely taking over - if this happens, the total electrostatic 
free energy changes sign. We demonstrate this effect on a 
model of a bacteriophage and summarize our results in Fig. [9] 




R [nm] 

Fig. 9 The lines separating the positive (above) and negative 
(below) values of the total free energy of a model bacteriophage 
with p = —0.7 eo/nm 3 , obtained in the dressed counterion 
approximation. Above the lines the DH term dominates, whereas 
below the lines the DC term takes over. The bacteriophage is 
assumed to be in a bathing solution that contains 1 mM of v-valent 
salt, in addition to cq of mono-valent salt. Full line for v = 4 and 
dashed for v = 2. 

The boundary line shows where the change of the sign of 
the free energy happens, i.e. the region below the two curves is 
where the DH approximation that neglects the non-mean- field 
effects of v-valent counterions may fail. At higher monovalent 
salt concentrations or smaller bacteriophage radii the DC part 
is much smaller than the leading DH term, meaning that the 
behavior of the total free energy is almost completely deter- 
mined by it. Below the boundary line, the DC term becomes 
the dominant one, with the total free energy increasing very 
rapidly with lower salt concentrations or larger radii. Increas- 
ing the bulk charge in general means increasing the regime 
where the DC part dominates. One should, however, exercise 
some care in interpreting these results. The dressed counterion 
approach is tailored for description of multivalent counterions 
with v ^> 1 and vanishing concentration, so that it may overes- 
timate the effects pertaining to 2-valent or even 4-valent coun- 
terions. Yet, the model results that we presented do indicate 
that the strong electrostatic interactions engendered by poly- 
valent counterions may be important for viruses, especially in 
the conditions of poor screening by the bathing monovalent 
salt. 

The osmotic pressure of the virion that can be deduced from 



the above discussion of the free energy behavior can become 
less repulsive or indeed shows an emerging attractive, i.e. neg- 
ative, component as a function of the volume density of the 
encapsidated DNA charge, p. This component has the same 
origin as the attraction observed between two charged macro- 
molecules mediated by polyvalent counterions^Sl and is due to 
counterion correlations in the strong electrostatic field of cap- 
sid charges. It thus acts to stabilize the capsid and diminish 
the osmotic pressure acting within. 

8 Relevance of physical insight for architecture 
of real viruses and their "lifecycle" 

We have seen that the balance of charges and the electrostatic 
energy is important for the virus stability, its assembly and 
function. This physical fact restricts the space of virus vari- 
ability, i.e. the types of proteins encoded by its genome. Of 
course, not all proteins encoded by the genome will be good 
candidates for a capsid - some of them will not be able to 
form a capsid for sterical reasons. Yet, some of them may 
easily form empty capsids in the conditions of thermodynam- 
ical equilibrium, but due to inadequate total charge, or its spa- 
tial distribution, cannot encapsidate the RNA molecule. This 
depends on the amino acid content of the virus protein, and 
the spatial distribution of charge-carrying amino acids. Thus, 
physical reasons encoded in the thermodynamics and free en- 
ergy of the assembled virus importantly reduce the landscape 
of possible virus mutants. This is especially important in self- 
assembling ssRNA viruses. In this case the total charge on 
the s sRNA and in the capsid need to be in a well defined re- 
lationE3nHnSli n order to prevent the assembly of empty cap- 
sids in thermodynamical equilibrium. The stabilizing effects 
of the nucleic acid in this case transpires also through the nega- 
tive values of the overall osmotic pressure dur to the polyelec- 
trolyte bridging effects of ssRNA. The length of ssRNA may 
thus act as a regulator of the capsid size, gathering sufficiently 
large number of proteins in order to screen its charge, i.e. to 
bring the total charge of the assembled virus within the borders 
enabling a spontaneous assembly (Brownian dynamics studies 
of ssRNA (generic polymer) virus assembly which account for 
the electrostatic interactions via model pairwise interactions 
can be found in The electrostatic interactions be- 

tween the ssRNA and the proteins can thus be viewe d as the 
reason for the characteristic size of the virus 111 230 PI 

The calculations we presented predict that there is a poly- 
electrolyte depleted region in the center of typical ssRNA 
viruses. One may wonder whether this depleted region can be 
partially or completely eliminated in viruses of smaller radii. 



ff A smilar effect has been experimentally foundl 2 -^ and theoretically dis- 
cussed 127 in context of assembly of virus proteins around functionalized (neg- 
atively charged) nanoscopic golden cores. 
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Such viruses may indeed be viable, but note that they would 
need to carry more charge on the capsid proteins. Imagine, for 
example, the evolution of an ssRNA virus whereby its proteins 
change and instead of e.g. T = 3 capsid they form a T = 1 
capsid Q Presuming that the ssRNA is not much shortened in 
this drastic evolutionary event and that it carries similar charge 
as before, the proteins would need to carry three times more 
charge in order to pack the ssRNA with the same efficiency 
as before (here we neglect the change in entropic component 
of the confined ssRNA molecule). It seems reasonable that it 
would be more difficult to reconcile the stability of the fold 
of such highly charged proteins with the requirement that the 
proteins, once folded, interact attractively with their neighbors 
in a formed capsid, i.e. that they expose required amino-acids 
in contact regions^. The electrostatics of individual proteins, 
the proteins in capsid and in contact with ssRNA may thus 
conspire to impose restrictions on virus size. 

The total charge on the DNA, i.e. its total length, seems to 
be a decisive factor concerning the stability of bacteriophages. 
Mutant phages with genomes longer or shorter (deletion mu- 
tants) from the wild type virus were studied experimentally 
(see Ref.l^and references therein). Intriguingly, it was found 
that the deletion mutants were more resistant to changes in the 
environment, particularly to heat shock. The mutant phages 
with genomes longer from the wild type variant were, on the 
other hand, found to be highly unstable^^. The consideration 
of electrostatic forces can explain these experimental findings, 
as the internal pressure in the shorter genome (less charged) 
mutants will be smaller. This may lead to improved mechan- 
ical stability of such mutants, but this does not necessarily 
mean that such mutants have evolutionary advantage. Viruses 
need to be sufficiently stable physically, but they must, in ad- 
dition, (i) contain all the information needed for their repli- 
cation, and (ii) dissasemble efficiently in appropriate condi- 
tions. One may speculate that the longer genomes may offer 
an evolutionary advantage as they enable the coding of addi- 
tional proteins, which may be used for different purposes, e.g. 
as scaffolding proteins ^ 234 ! enabling a more precise (and pos- 
sibly quicker) assembly of viruses in infected cells. On the 
other hand, the need for packing the additional "information" 
may render the virus unstable. The virus evolution is thus di- 
rectly influenced by physical constraints on the molecular and 
nano- scale levels. This is of course true for evolution of all 
organisms, but it becomes particularly transparent in the case 
of simple macromolecular assemblies such as viruses. 

It was found experimentally that the size and mass of bac- 



* Some viruses are dimorphic when assembled without the genome molecule. 
To observed the dimorphism a modification of the capsid protein is sometimes 
required. C-termini modified (shortened) capsid proteins of hepatitis B virus 
have been experimentally observed to form T = 3 and T = 4 capsidsl^l The 
percentage of T = 4 capsid was found to vary, depending on the amount of 
shortening of the C-termini^^ (see also RefJ^O). 



teriophage capsids are highly correlated with the genome 
s j ze |233] jfa s can a g am explained from the electrostatic 
considerations, as the phages with longer genomes need larger 
capsids in order to reduce the electrostatic repulsion in the 
functional (filled) virus. Interestingly, the same type of cor- 
relation does not hold for ssRNA viruses^SSIl, as the elec- 
trostatic interactions there scale differently with capsid radius, 
and the total interior capsid charge is a decisive factor that 
determines the ssRNA lengtrJSDn (see Figs. [4] and [6]). The 
observed differences in "architectures" of ssRNA and bacte- 
riophage viruses are also a consequence of their "lifestyles", 
in particular the fact that many ssRNA viruses self-assemble, 
in contrast to bacteriophages which do not. 

9 Conclusions 

We have shown that the size and architecture of viruses are 
importantly constrained by the (nonspecific) electrostatic in- 
teractions acting between the constituents of a virus, i.e. its 
proteins and its genome molecule. A rough understanding of 
these physical constraints can be grasped from simple mean- 
field expression for the electrostatic contribution to virus en- 
ergy (Eqs. ([7]) and ( [T6| )) and pressure (Eqs. ( [23] ) and (25 0- A 
more detailed and quantitative argumentation would also have 
to take into account that electrostatic interactions are only part 
of the whole picture and that e.g. ion specific effects together 
with hydration interactions put finer details on the conclusions 
reached above. Unfortunately these effects are much less un- 
derstood and do not allow for simple quantitative estimates as 
is the case with electrostatic interactions. 

Viruses are information encoding machines and most mi- 
crobiological/virological studies of viruses concentrate on this 
aspect of viruses (see e.g. Refill). Indeed, the viral genome 
must encode all the proteins required for the life-cycle of a 
virus and missing in the cellular environment. The typical 
length of ssRNA genome is often discussed in terms of repli- 
cation fidelity2E23 xhe rate of mutation of ssRNA viruses'^ 
is about 1(T 4 to 1(T 5 errors per nucleotide per round of copy- 
ing which means that viruses in the daughter generation typ- 
ically con tain o ne mutation when compared to the parental 
generatio n 2 l 236 [ This replication error rate is often considered 
as the factor restricting the genome length that can be faith- 
fully maintained. We have shown that the genome length is 
also a physically important quantity that should be examined 
in context of the size and charge distribution of virus cap- 
sid (phenotype). This means that the genome length is con- 
strained additionally by the physical interactions conveyed by 
the genome molecule when packed with proteins it encodes. 
We find here an interesting interplay of non-physical (non- 
material) and physical/structural aspects of the information 
encoded in a genome molecule that must be tuned properly 
in a functional virus. 
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